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4 
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7 
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W--> 27 

29 
30 
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34 
35 



Input Set : A:\Sequence.List.for.l0817530.txt 
Output Set: N:\CRF4\10052004\J817530.raw 

<110> APPLICANT: Braun, Werner 
Mathura, Venkatarajan S. 
Schein, Catherine H. 

<120> TITLE OF INVENTION: PHYSICAL -CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS 

REGARDING SAME 
<130> FILE REFERENCE: 265.00400101 
<140> CURRENT APPLICATION NUMBER: 10/817,530 
<141> CURRENT FILING DATE: 2004-04-02 
<150> PRIOR APPLICATION NUMBER: US 60/460,769 

<151> PRIOR FILING DATE: 2003-04-04 v 
<160> NUMBER OF SEQ ID NOS : 5 ^%AAJJ^\ 
<170> SOFTWARE: Patentln version 3.2 jv Sa_^J^ /fT^^^^ 

<210> SEQ ID NO: 1 . * - . v ^fU^JL- J) *\r H ^ 



0068 Not Comply 
Corrected Diskette Needer 



IALj;MEM 

A 



<211> LENGTH: 10 
<212> TYPE: PRT 
<213> ORGANISM: ARTIFICIAL] 
<220> FEATURE: 
<223> OTHER INFORMATION 
<400> 1 

Pro Asp lie Leu Cys Leu Gin Glu Thr Lys 

15 10 

<210> SEQ ID NO: 2 

<211> LENGTH: 275 

<212> TYPE: PRT 1 



EMBER OF DNase-I SUPERFAMILY 





C- -> 


36 


<213> ORGANISM: 


ARTI F I C I AL JMEMBER OF DNase-I SUPERFAMILY 




w--> 


38 


<220> FEATURE: 
























w- -> 


38 


<223> OTHER 


INFORMATION 




















w--> 


38 


<400> 2 
































40 


Leu 


Tyr 


Glu 


Asp 


Pro 


Pro 


Asp 


Gin 


Lys 


Thr 


Ser 


Pro 


Ser 


Gly 


Lys 


Pro 




41 


1 








5 










10 










15 






44 


Ala 


Thr 


Leu 


Lys 


He 


Cys 


Ser 


Trp 


Asn 


Val 


Asp 


Gly 


Leu 


Arg 


Ala 


Trp 




45 








20 










25 










30 








48 


He 


Lys 


Lys 


Lys 


Gly 


Leu 


Asp 


Trp 


Val 


Lys 


Glu 


Glu 


Ala 


Pro 


Asp 


He 




49 






35 










40 










45 










52 


Leu 


Cys 


Leu 


Gin 


Glu 


Thr 


Lys 


Cys 


Ser 


Glu 


Asn 


Lys 


Leu 


Pro 


Ala 


Glu 




53 




50 










55 










60 












56 


Leu 


Gin 


Glu 


Leu 


Pro 


Gly 


Leu 


Ser 


His 


Gin 


Tyr 


Trp 


Ser 


Ala 


Pro 


Ser 




57 


65 










70 










75 










80 




60 


Asp 


Lys 


Glu 


Gly 


Tyr 


Ser 


Gly 


Val 


Gly 


Leu 


Leu 


Ser 


Arg 


Gin 


Cys 


Pro 




61 










85 










90 










95 






64 


Leu 


Lys 


Val 


Ser 


Tyr 


Gly 


He 


Gly 


Asp 


Glu 


Glu 


His 


Asp 


Gin 


Glu 


Gly 




65 








100 










105 










110 








68 


Arg 


Val 


He 


Val 


Ala 


Glu 


Phe 


Asp 


Ser 


Phe 


Val 


Leu 


Val 


Thr 


Ala 


Tyr 




fa hi 
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69 






115 










ion 










IOC 
J. Z D 








72 


Val 


Pro 


Asn 


Ala 


Cjiy 


Arg 


Cjj_y 


Leu 




Arg 


Leu 




lyr 


Arg 




Arg 


73 




130 










135 










± *± u 










76 


Trp 


Asp 


Glu 


Ala 


Phe 


Arg 


Lys 


Phe 


Leu 


Lys 


Gly 


Leu 


Ala 


Ser 


Arg 


Lys 


77 


145 










150 










155 










160 


80 


Pro 


Leu 


Val 


Leu 


Cys 


Gly 


Asp 


Leu 


Asn 


Val 


Ala 


His 


Glu 


Glu 


He 


Asp 


81 










165 










170 










175 




84 


Leu 


Arg Asn 


Pro 


Lys 


Gly 


Asn 


Lys 


Lys 


Asn 


Ala 


Gly 


Phe 


Thr 


Pro 


Gin 


85 








180 










185 










190 






88 


Glu 


Arg 


Gin 


Gly 


Phe 


Gly 


Glu 


Leu 


Leu 


Gin 


Ala 


Val 


Pro 


Leu 


Ala 


Asp 


89 






195 










200 










205 








92 


Ser 


Phe 


Arg 


His 


Leu 


Tyr 


Pro 


Asn 


Thr 


Pro 


Tyr 


Ala 


Tyr 


Thr 


Phe 


Trp 


93 




210 










215 










220 










96 


Thr 


Tyr 


Met 


Met 


Asn 


Ala 


Arg 


Ser 


Lys 


Asn 


Val 


Gly 


Trp 


Arg 


Leu 


Asp 


97 


225 










230 










235 










240 





100 


Tyr 


Phe 


Leu 


Leu 


Ser 


His 


Ser 


Leu 


Leu 


Pro 


Ala 


Leu 


Cys 


Asp 


Ser 


Lys 




101 










245 










250 










255 






104 


He 


Arg 


Ser Lys Ala Leu Gly Ser Asp His 


Cys 


Pro 


He 


Thr 


Leu 


Tyr 




105 








260 










265 










270 








108 


Leu 


Ala 


Leu 






























109 






275 






























llz 


<210> SEQ ID NO: 


3 


























113 


<211> LENGTH: 268 


























114 


<212> TYPE: 


PRT 


























C--> 


115 


<213> ORGANISM: 


ARTIFICIAL^ 


1EMBER OF DNase-I SUPERFAMILY 




w--> 


117 


<220> FEATURE: 






r 




















w--> 


117 


<223> OTHER 


INFORMATION: 




















w--> 


117 


<400> 3 
































119 


Met 


Lys 


Phe 


Val 


Ser 


Phe 


Asn 


He 


Asn 


Gly 


Leu 


Arg 


Ala 


Arg 


Pro 


His 




120 


1 








5 










10 










15 






123 


Gin 


Leu 


Glu 


Ala 


He 


Val 


Glu 


Lys 


His 


Gin 


Pro 


Asp 


Val 


He 


Gly 


Leu 




124 








20 










25 










30 








127 


Gin 


Glu 


Thr 


Lys 


Val 


His 


A sp 


Asp 


Met 


Phe 


Pro 


Leu 


Glu 


Glu 


Val 


Ala 




128 






35 










40 










45 










131 


Lys 


Leu 


Gly 


Tyr 


Asn 


Val 


Phe 


Tyr 


His 


Gly 


Gin 


Lys 


Gly 


His 


Tyr 


Gly 




132 




50 










55 










60 












135 


Val 


Ala 


Leu 


Leu 


Thr 


Lys 


Glu 


Thr 


Pro 


He 


Ala 


Val 


Arg 


Arg 


Gly 


Phe 




136 


65 










70 










75 










80 




139 


Pro 


Gly 


Asp 


Asp 


Glu 


Glu 


Ala 


Gin 


Arg 


Arg 


He 


He 


Met 


Ala 


Glu 


He 




140 










85 










90 










95 






143 


Pro 


Ser 


Leu 


Leu 


Gly 


Asn 


Val 


Thr 


Val 


He 


Asn 


Gly 


Tyr 


Phe 


Pro 


Gin 




144 








100 










105 










110 








147 


Gly 


Glu 


Ser 


Arg 


Asp 


His 


Pro 


He 


Lys 


Phe 


Pro 


Ala 


Lys 


Ala 


Gin 


Phe 




148 






115 










120 










125 










151 


Tyr 


Gin 


Asn 


Leu 


Gin 


Asn 


Tyr 


Leu 


Glu 


Thr 


Glu 


Leu 


Lys 


Arg 


Asp 


Asn 




152 




130 










135 










140 












155 


Pro 


Val 


Leu 


He 


Met 


Gly 


Asp 


Met 


Asn 


He 


Ser 


Pro 


Thr 


Asp 


Leu 


Asp 




156 


145 










150 










155 










160 




159 


He 


Gly 


He 


Gly 


Glu 


Glu 


Asn 


Arg 


Lys 


Arg 


Trp 


Leu 


Arg 


Thr 


Gly 


Lys 
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lb U 










J. O -> 










170 










175 






lb .3 


Cys 


Ser 


Phe 


Leu 


Dm 
tr L (J 


ul U. 


m n 

Ul u. 




fJl n 




Met 


Asp 


Arg 


Leu 


Met 


Ser 




lb4 








180 




















190 








167 


Trp Gly 


Leu 


Val 


Asp 


Thr 




Arg 


nlo 


r\.L CX. 


A d -n 


IT i. \J 


Gin 


Thr 


Ala 


Asp 




loo 






195 










200 










205 










1/1 


Arg 


Phe 


Ser 


Trp 


Phe 


Asp 


Tyr 


Arg 


Ser 


Lys 


Gly 


Phe 


Asp 


Asp 


Asn Arg 




1 /z 




210 










215 










220 












175 


Gly Leu 


Arg 


He 


Asp 


Leu 


Leu 


Leu 


Ala 


Ser 


Gin 


Pro 


Leu 


Ala 


Glu 


cys 




176 


225 










230 










235 














179 


Cys 


Val 


Glu 


Thr 


Gly 


He 


Asp 


Tyr 


Glu 


He 


Arg 


Ser 


Met 


Glu 


Lys 


Pro 




180 










245 










250 










255 






183 


Ser Asp 


His 


Ala 


Pro 


Val 


Trp 


Ala 


Thr 


Phe 


Arg 


Arg 












184 








260 










265 


















187 


<210> SEQ I'D NO: 


: 4 


























188 


<211> LENGTH: 258 


























189 


<212> TYPE: 


PRT 


























- > 


190 


<213> ORGANISM: 


ARTIFICIAIiA* 


1EMBER OF DNase-I SUPERFAMILY 




- > 


192 


<220> FEATURE: 








i 

J 


















- > 


192 


<223> OTHER 


INFORMATION : £r 




















- > 


192 


<400> 4 
































194 


Leu 


Lys 


He 


Ala 


Ala 


Phe 


Asn 


He 


Arg 


Thr 


Phe 


Gly 


Glu 


Thr 


Lys 


i v ie u 




195 


1 








5 










10 










15 






198 


Ser 


Asn 


Ala 


Thr 


Leu 


Ala 


Ser 


Tyr 


He 


Val. 


Arg 


He 


Val 


Arg 


Arg 


Tyr 




199 








20 










25 










30 








202 


Asp 


He 


Val 


Leu 


He 


Gin 


Glu 


Val 


Arg 


Asp 


Ser 


His 


Leu 


Val 


Ala 


val 




203 






35 










40 










45 










206 


Gly 


Lys 


Leu 


Leu 


Asp 


Tyr 


Leu 


Asn 


Gin 


Asp 


Asp 


Pro 


Asn 


Thr 


Tyr 


n x o 




207 




50 










55 










60 












210 


Tyr 


Val 


Val 


Ser 


Glu 


Pro 


Leu 


Gly 


Arg 


Asn 


Ser 


Tyr 


Lys 


Glu 


Arg 


y 




211 


65 










70 










75 










ft n 

o u 




214 


Leu 


Phe 


Leu 


Phe 


Arg 


Pro 


Asn 


Lys 


Val 


Ser 


Val 


Leu 


Asp 


Thr 


Tyr 


Gin 




215 










85 










90 










95 






218 


Tyr 


Asp 


Asp 


Gly 


Cys 


Cys 


Gly 


Asn 


Asp 


Ser 


Phe 


Ser 


Arg 


Glu 


Pro 


Ala 




219 








100 










105 










110 








*i n o 

zzz 


Val 


Val 


Lys 


Phe 


Ser 


Ser 


His 


Ser 


Thr 


Lys 


Val 


Lys 


Glu 


Phe 


Ala 


He 




22 3 






115 










120 










125 










9 9 £ 

Z Z D 


Val 


Ala 


Leu 


His 


Ser 


Ala 


Pro 


Ser 


Asp 


Ala 


Val 


Ala 


Glu 


He 


Asn 


Ser 




227 




130 










135 










140 












230 


Leu 


Tyr 


Asp 


Val 


Tyr 


Leu Asp 


Val 


Gin 


Gin 


Lys 


Trp 


His 


Leu 


Asn 


Asp 




231 


145 










150 










155 










160 




234 


Val 


Met 


Leu 


Met 


Gly 


Asp 


Phe 


Asn 


Ala 


Asp 


Cys 


Ser 


Tyr 


Val 


Thr 


Ser 




235 










165 










170 










175 






238 


Ser 


Gin 


Trp 


Ser 


Ser 


He 


Arg 


Leu 


Arg 


Thr 


Ser 


Ser 


Thr 


Phe 


Gin 


Trp 




239 








180 










185 










190 








242 


Leu 


He 


Pro 


Asp 


Ser 


Ala Asp 


Thr 


Thr 


Ala 


Thr 


Ser 


Thr 


Asn 


Cys 


Ala 




243 






195 










200 










205 










246 


Tyr 


Asp 


Arg 


He 


Val 


Val 


Ala 


Gly 


Ser 


Leu 


Leu 


Gin 


Ser 


Ser 


Val 


Val 




247 




210 










215 










220 












250 


Pro 


Gly 


Ser 


Ala 


Ala 


Pro 


Phe 


Asp 


Phe 


Gin 


Ala 


Ala 


Tyr 


Gly 


Leu 


Ser 
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251 



225 



230 



235 



240 



2 54 Asn Glu Met Ala Leu Ala He Ser Asp His Tyr Pro Val Glu Val Thr 
255 245 250 255 

258 Leu Thr 

262 <210> SEQ ID NO: 5 

263 <211> LENGTH: 336 

264 <212> TYPE: PRT \ 



C--> 265 <213> ORGANISM: ART I F I C I AL[;]MEMB ER OF DNase-I SUPERFAMILY 



W--> 267 


<400> 5 






























269 


Tyr 


Asp 


Pro 


He 


His 


Glu 


Tyr 


Val 


Asn 


His 


Glu 


Leu 


Arg 


Lys 


Arg 


Glu 


270 


1 








5 










10 










15 




273 


Asn 


Glu 


Phe 


Ser 


Glu 


His 


Lys 


Asn 


Val 


Lys 


He 


Phe 


Val 


Ala 


Ser 


Tyr 


274 








20 










25 










30 






277 


Asn 


Leu 


Asn Gly 


Cys 


Ser 


Ala 


Thr 


Thr 


Lys 


Leu 


Glu 


Asn 


Trp 


Leu 


Phe 


278 






35 






> 




40 










45 








281 


Pro 


Glu 


Asn 


Thr 


Pro 


Leu 


Ala 


Asp 


lie 


Tyr 


Val 


Val 


Gly 


Phe 


Gin 


Glu 


282 




50 










55 










60 










285 


He 


Val 


Gin 


Leu 


Thr 


Ser 


Ala 


Asp 


Pro 


Ala 


Lys 


Arg 


Arg 


Glu 


Tr P 


Glu 


286 


65 










70 










75 










80 


289 


Ser 


Cys 


Val 


Lys 


Arg 


Leu 


Leu 


Asn 


Gly 


Lys 


Cys 


Thr 


Ser 


Gly 


Pro 


Gly 


290 










85 










90 










95 




293 


Tyr 


Val 


Gin 


Leu 


Arg 


Ser 


Gly 


Gin 


Leu 


Val 


Gly 


Thr 


Ala 


Leu 


Met 


He 


294 








100 










105 










110 




* 


297 


Phe 


Cys 


Lys 


Glu 


Ser 


Cys 


Leu 


Pro 


Ser 


He 


Lys 


Asn 


Val 


Glu 


Gly 


Thr 


298 






115 










120 










125 








301 


Val 


Lys 


Lys 


Thr 


Gly 


Leu 


Gly 


Asn 


Lys 


Gly 


Ala 


Val 


Ala 


He 


Arg 


Phe 


302 




130 










135 










140 










305 


Asp 


Tyr 


Glu 


Asp 


Thr 


Gly 


Leu 


Cys 


Phe 


He 


Thr 


Ser 


His 


Leu 


Ala 


Ala 


306 


145 










150 










155 










160 


309 


Gly 


Tyr 


Thr 


Asn 


Tyr 


Asp 


Glu 


Arg 


Asp 


His 


Asp 


Tyr 


Arg 


Thr 


He 


Ala 


310 










165 










170 










175 




313 


Ser 


Gly 


Leu 


Arg 


Phe 


Arg 


Arg 


Gly 


Arg 


Ser 


He 


Phe 


Asn 


His 


Asp 


Tyr 


314 








180 










185 










190 






317 


Val 


Val 


Trp 


Phe 


Gly 


Asp 


Phe 


Asn 


Tyr 


Arg 


He 


Ser 


Leu 


Thr 


Tyr 


Glu 


318 






195 










200 










205 








321 


Glu 


Val 


Val 


Pro 


Cys 


He 


Ala 


Gin 


Gly 


Lys 


Leu 


Ser 


Tyr 


Leu 


Phe 


Glu 


322 




210 










215 










220 










325 


Tyr 


Asp 


Gin 


Leu 


Asn 


Lys 


Gin 


Met 


Leu 


Thr 


Gly 


Lys 


Val 


Phe 


Pro 


Phe 


326 


225 










230 










235 










240 


329 


Phe 


Ser 


Glu 


Leu 


Pro 


He 


Thr 


Phe 


Pro 


Pro 


Thr 


Tyr 


Lys 


Phe 


Asp 


He 


330 










245 










250 










255 




333 


Gly 


Thr 


Asp 


He 


Tyr 


Asp 


Thr 


Ser 


Asp 


Lys 


His 


Arg 


Val 


Pro 


Ala 


Trp 


334 








260 










265 










270 






337 


Thr 


Asp 


Arg 


He 


Leu 


Tyr 


Arg 


Gly 


Glu 


Leu 


Val 


Pro 


His 


Ser 


Tyr 


Gin 


338 






275 










280 










285 








341 


Ser 


Val 


Pro 


Leu 


Tyr 


Tyr 


Ser 


Asp 


His 


Arg 


Pro 


He 


Tyr 


Ala 


Thr 


Tyr 


342 




290 










295 










300 











W--> 267 
W--> 267 



<220> FEATURE: 
<223> OTHER IN 
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345 Glu Ala Asn lie Val Lys Val Asp Arg Giu Lys Lys Lys He Leu Phe 

346 305 310 315 320 

349 Glu Glu Leu Tyr Asn Gin Arg Lys Gin Glu Val Arg Asp Ala Ser Gin 

350 325 330 335 
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Use of <220> Feature (NEW RULES): 

Sequence (s) are missing the <22 0> Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213> ORGANISM is "Artificial Sequence" 
or "Unknown" . Please explain source of genetic material in <220> to <223> 
section (S ee "Federal Register," 6/01/98, Vol, 63, No. 104 , pp. 29631-32) 
(Sec. 1.823 of new Rules) 

Seq# : 1 , 2 , 3 , 4 , 5 
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L:25 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:1 
L:27 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:1 / <213> 
ORGANISM: Artificial Sequence 

L:27 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:1, <213> 
ORGANISM : Art if icial Sequence 

L:27 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 1 , Line# : 27 
L:36 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:2 
L:38 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ# : 2 , <213> 
ORGANISM: Art if icial Sequence 

L:38 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ# : 2 , <213> 
ORGANISM Artificial Sequence 

L:38 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 2 , Line# : 38 
L:115 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 3 
L:117 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ# : 3 , <213> 
ORGANISM: Artificial Sequence 

L:117 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ# : 3 , <213> 
ORGANISM: Artificial Sequence 

L:117 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 3 , Line# : 117 
L:190 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:4 
L:192 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ# : 4 , <213> 
ORGANISM: Artificial Sequence 

L:192 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ# : 4 , <213> 
ORGANISM: Artificial Sequence 

L:192 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 4 , Line# : 192 
L:265 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:5 
L:267 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:5, <213> 
ORGANISM: Artificial Sequence 

L:267 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:5, <213> 
ORGANISM: Artificial Sequence 

L:267 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 5 , Line# : 267 
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